18.657: Mathematics of Machine Learning

نویسندگان

  • Philippe Rigollet
  • Haihao Lu
چکیده

3.1 Setup The stochastic multi-armed bandit is a classical model for decision making and is defined as follows: There are K arms(different actions). Iteratively, a decision maker chooses an arm k ∈ {1, . . . ,K}, yielding a sequence XK,1, . . . ,XK,t, . . ., which are i.i.d random variables with mean μk. Define μ∗ = maxj μj or ∗ ∈ argmax. A policy π is a sequence {πt}t≥1, which indicates which arm to be pulled at time t. πt ∈ {1, . . . ,K} and it depends only on the observations strictly interior to t. The regret is then defined as:

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Forecasting the Tehran Stock market by Machine ‎Learning Methods using a New Loss Function

Stock market forecasting has attracted so many researchers and investors that ‎many studies have been done in this field. These studies have led to the ‎development of many predictive methods, the most widely used of which are ‎machine learning-based methods. In machine learning-based methods, loss ‎function has a key role in determining the model weights. In this study a new loss ‎function is ...

متن کامل

Comparative Analysis of Machine Learning Algorithms with Optimization Purposes

The field of optimization and machine learning are increasingly interplayed and optimization in different problems leads to the use of machine learning approaches‎. ‎Machine learning algorithms work in reasonable computational time for specific classes of problems and have important role in extracting knowledge from large amount of data‎. ‎In this paper‎, ‎a methodology has been employed to opt...

متن کامل

Similarity measurement for describe user images in social media

Online social networks like Instagram are places for communication. Also, these media produce rich metadata which are useful for further analysis in many fields including health and cognitive science. Many researchers are using these metadata like hashtags, images, etc. to detect patterns of user activities. However, there are several serious ambiguities like how much reliable are these informa...

متن کامل

Machine learning algorithms in air quality modeling

Modern studies in the field of environment science and engineering show that deterministic models struggle to capture the relationship between the concentration of atmospheric pollutants and their emission sources. The recent advances in statistical modeling based on machine learning approaches have emerged as solution to tackle these issues. It is a fact that, input variable type largely affec...

متن کامل

Some Perspectives of Machine Repair Problems

In this article, we survey machine repair problems (MRP) with an emphasis on historical developments of queuing models of practical importance. The survey proceeds historically, starting with developments in 1985, when the first published review on machine interference models appeared. We attempt to elaborate some basic MRP models of the real life congestion situations. The brief survey of some...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015